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SEQUENCE LISTING 



<110> The Government of the United States of America, as represented by 
the Secretary, Department of Health and Human Services, 
c/o Centers for Disease Control and Prevention 

Khudyakov, Yury p^j 
Fields, Howard <^> 



ZD 

^ ^ rn 

<120> MOSAIC PROTEIN AND RESTRICTION ENDONUCLEASE ASSISTED § 
LIGATION METHOD FOR MAKING THE SAME — I py| 

<130> 14114. 0344U2 3^ ro 

~ 1 m 
o 



ro 

CO 

<141> 01-25-00 S 



<140> 09/491,146 ro 




<150> 08/921,887 

<151> 25-08-97 

<160> 55 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 55 

<212> DNA 

<213> Hepatitis C virus 

<400> 1 

ccccgaattc aaccgaaacc gcaacgtaaa accaaacgta acaccattcg tcgtc 55 

<210> 2 

<211> 69 

<212> DNA 

<213> Hepatitis C virus 

<400> 2 

ccccggatcc tatttcggac caacgatctg accaccaccc gggaatttaa cgtcctgcgg 60 
acgacgaat 69 

<210> 3 

<211> 54 

<212> DNA 

<213> Hepatitis C virus 

<400> 3 

ccccgaattc aaccgaaacc gcaacgtcag accaaacgta acaccaaccg tcgt 54 

<210> 4 

<211> 70 

<212> DNA 

<213> Hepatitis C virus 
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~< 4 0 0 >-4 — 

ccccggatcc tatttcggac caacgatctg accaccaccc gggaatttaa cgtcctgcgg 60 
acgacggttg 70 

<210> 5 
<211> 51 
<212> DNA 

<213> Hepatitis C virus 
<400> 5 

ccccgaattc aaccgaaacc gcaacgtaaa accaaacgta acacctaccg t 51 

<210> 6 
<211> 73 
<212> DNA 

<213> Hepatitis C virus 
<400> 6 

ccccggatcc tatttcggac caacgatctg accaccaccc gggaatttaa cgtcctgcgg 60 
acgacggtag gtg 7 3 

<210> 7 
<211> 55 
<212> DNA 

<213> Hepatitis C virus 
<400> 7 

ccccgaattc aaccgaaacc gcaacgtaaa ccgaaccgta acaccaaccg tcgtc 55 

<210> 8 
<211> 69 
<212> DNA 

<213> Hepatitis C virus 
<400> 8 

ccccggatcc tatttcggac caacgatctg accaccaccc gggaatttaa cgtcctgcgg 60 
acgacggtt 69 

<210> 9 
<211> 63 
<212> DNA 

<213> Hepatitis C virus 
<400> 9 

ccccgaattc aaccgaaacc gcaacgtcag ccgaaacgta acaccccgcg tcgtccgcag 60 
gac 63 

<210> 10 
<211> 60 
<212> DNA 

<213> Hepatitis C virus 
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<400> 10 

ccccggatcc tatttcggac caacgatctg a c c ac c a£cc ggga a ttt a a_cg t c c t gegg — 60~ 



<210> 11 
<211> 55 
<212> DNA 

<213> Hepatitis C virus 
<400> 11 

ccccgaattc aaccgaaacc gcaacgtaaa accaaacgta acgctcaccg tcgtc 55 

<210> 12 
<211> 69 
<212> DNA 

<213> Hepatitis C virus 
<400> 12 

ccccggatcc tatttcggac caacgatctg accaccaccc gggaatttaa cgtcctgcgg 60 
acgacggtg 69 

<210> 13 
<211> 55 
<212> DNA 

<213> Hepatitis C virus 
<400> 13 

ccccgaattc aaccgaaacc gcaacgtaaa aaccagcgta acaccaaccg tcgtc 55 

<210> 14 
<211> 69 
<212> DNA 

<213> Hepatitis C virus 
<400> 14 

ccccggatcc tatttcggac caacgatctg accaccaccc gggaatttaa cgtcctgcgg 60 
acgacggtt 69 

<210> 15 
<211> 55 
<212> DNA 

<213> Hepatitis C virus 
<400> 15 

ccccgaattc aaccgaaacc gcaacgtaaa accaaacgta acaccattcg tcgtc 55 

<210> 16 
<211> 69 
<212> DNA 

<213> Hepatitis C virus 



<400> 16 
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ccccggatcc tatttcggaa cgtagataac accaccaccc gggaatttaa cgtcctgcgg 60 
ac gacgaat _ _ _ 6.9- 



<210> 17 
<211> 56 
<212> DNA 

<213> Hepatitis C virus 
<400> 17 

ccccgaattc aaccgaaacc gcaacgtaaa accgaacgta acaccaaccg tcgtcc 56 

<210> 18 
<211> 65 
<212> DNA 

<213> Hepatitis C virus 
<400> 18 

ccccggatcc tatttcggac caacgatctg accaccacca gagaaacgaa cgtccggacg 60 
acggt 65 

<210> 19 
<211> 54 
<212> DNA 

<213> Hepatitis C virus 
<400> 19 

ccccgaattc aaccgaaacc gaaacgtcag accaaacgta acaccctgcg tcgt 54 

<210> 20 
<211> 73 
<212> DNA 

<213> Hepatitis C virus 
<400> 20 

ccccggatcc tatttcggac caacgatctg accaccagcc gggaatttaa cgtttttcgg 60 
acgacgacgc agg 7 3 

<210> 21 
<211> 55 
<212> DNA 

<213> Hepatitis C virus 
<400> 21 

ccccgaattc aaccgaaacc gcaacgtaaa accaaacgta aagctcaccg tcgtc 55 

<210> 22 

<211> 69 

<212> DNA 

<213> Hepatitis C virus 



<400> 22 

ccccggatcc tatttcggac caacgatctg accaccaccc gggaatttaa cgtcctgcgg 60 
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acgacggtg 69 



<210>"2 3 ~ 
<211> 28 
<212> PRT 

<213> Hepatitis C virus 



<400> 23 

Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr lie Arg Arg Pro Gin 

15 10 15 

Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 

<210> 24 
<211> 28 
<212> PRT 

<213> Hepatitis C virus 



<400> 24 

Pro Lys Pro Gin Arg Gin Thr Lys 

1 5 
Asp Val Lys Phe Pro Gly Gly Gly 
20 



Arg Asn Thr Asn Arg Arg Pro Gin 

10 15 
Gin. lie Val Gly 
25 



<210> 25 

<211> 28 

<212> PRT 

<213> Hepatitis C virus 



<400> 25 

Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Tyr Arg Arg Pro Gin 

15 10 15 

Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 

<210> 26 
<211> 28 
<212> PRT 

<213> Hepatitis C virus 
<400> 26 

Pro Lys Pro Gin Arg Lys Pro Asn Arg Asn Thr Asn Arg Arg Pro Gin 

15 10 15 

Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 

<210> 27 
<211> 28 
<212> PRT 

<213> Hepatitis C virus 
<400> 27 

Pro Lys Pro Gin Arg Gin Pro Lys Arg Asn Thr Pro Arg Arg Pro Gin 

1 5 10 15 

Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 
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<210> 28 

~^ 2 11 ^~2 8 - — 

<212> PRT 

<213> Hepatitis C virus 
<400> 28 

Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Ala His Arg Arg Pro Gin 

15 10 15 

Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 

<210> 29 
<211> 28 
<212> PRT 

<213> Hepatitis C virus 
<400> 29 

Pro Lys Pro Gin Lys Arg Asn Gin Arg Asn Thr Asn Arg Arg Pro Gin 

15 10 15 

Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 

<210> 30 
<211> 28 
<212> PRT 

<213> Hepatitis C virus 
<400> 30 

Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr lie Arg Arg Pro Gin 

15 10 15 

Asp Val Lys Phe Pro Gly Gly Gly Val lie Tyr Val 
20 25 

<210> 31 
<211> 28 
<212> PRT 

<213> Hepatitis C virus 
<400> 31 

Pro Lys Pro Gin Arg Lys Thr Glu Arg Asn Thr Asn Arg Arg Pro Gin 

15 10 15 

Asp Val Arg Phe Ser Gly Gly Gly Gin lie Val Gly 
20 25 

<210> 32 
<211> 28 
<212> PRT 

<213> Hepatitis C virus 



<400> 32 

Pro Lys Pro Lys Arg Gin Thr Lys Arg Asn Thr Leu Arg Arg Pro Lys 

15 10 15 

Asn Val Lys Phe Pro Ala Gly Gly Gin lie Val Gly 
20 25 

<210> 33 
<211> 28 
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<212> PRT 

_ _< 21 3_>__He p a 1 1 1 i s_C .virus - 

<400> 33 

Pro Lys Pro Gin Arg Lys Thr Lys Arg Lys Ala His Arg Arg Pro Gin 

15 10 15 

Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 

<210> 34 
<211> 82 
<212> DNA 

<213> Hepatitis C virus 
<400> 34 

ccccgaattc aagccgccca cataccatac ctagaacaag gaatgcatct cgcagaacaa 60 
ttcaaacaaa aggcacttcg tc 82 

<210> 35 
<211> 84 
<212> DNA 

<213> Hepatitis C virus 
<400> 35 

ccccggatcc taactagcct cttccatctc atcaaactcc tgatacaaaa cctccctatc 60 
cgggataaca gccggacgaa gtgc 84 

<210> 36 
<211> 85 
<212> DNA 

<213> Hepatitis C virus 
<400> 36 

ccccgaattc aagctagtca cttaccgtat atcgagcagg gaatgcagtt agctgaacag 60 
tttaaacaga aggctc.tggc ttttg 85 

<210> 37 
<211> 81 
<212> DNA 

<213> Hepatitis C virus 
<400> 37 

ccccggatcc taaggccgag cgtcagactc aggaacataa tgagtaggag aaacatgatt 60 
accccgagaa gcaaaagcca g 81 

<210> 38 
<211> 81 
<212> DNA 

<213> Hepatitis C virus 
<400> 38 

ccccgaattc aacggcctgc gataataccg gatagggagg ttcttcatag ggagtttgac 60 
gagatggagg aggcttttgc g 81 



<210> 39 
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<211> 79 

<212> J)NA_ 

<213> Hepatitis C virus 



<400> 39 

ccccggatcc tactgcgaag catcagactc aggaacataa tgagccggac taacatgatt 60 
cccacgagac gcaaaagcc 79 



<210> 40 

<211> 83 

<212> DNA 

<213> Hepatitis C virus 



<400> 40 

ccccgaattc aatcgcaggc ggcgccttat attgagcagg ctcaggttat tgctcatcag 60 
tttaaggaga aggttcttgc ttt 83 



<210> 41 

<211> 83 

<212> DNA 

<213> Hepatitis C virus 



<400> 41 

ccccggatcc tacggcttcg cgtccgactc aggaacataa tgagtcggag aatcatgatt 60 
accacgagaa gcaaaagcaa gaa 83 



<210> 42 

<211> 79 

<212> DNA 

<213> Hepatitis C virus 



<400> 42 

ccccgaattc aaaagccggc gataatccct gaccgtgagg ttctgtatcg tgagtttgat 60 
gagatggagg agtcacagc 79 

<210> 43 
<211> 87 
<212> DNA 

<213> Hepatitis C virus 
<400> 43 

ccccggatcc taaaacgcca gagccttctg cttaaactgc tcagcaagca tcataccctg 60 
ctcaatgtac ggaagatgct gtgactc 87 

<210> 44 
<211> 76 
<212> DNA 

<213> Hepatitis C virus 



<400> 44 

ccccgaattc aagcgtttgc ttctcgtggt aatcatgttg ctccgactca ttatgttacg 
gagtcagatg ctaagc 



60 
76 
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<210> 45 

<211> 84 _ 

- <-2-l-2 >— DNA— ~ 

<213> Hepatitis C virus 

<400> 45 

ccccggatcc tagaaagcct cctccatctc atcatactgc tgataaagaa cctccttatc 60 
cggaaccaga gccggcttag catc 84 

<210> 46 
<211> 80 
<212> DNA 

<213> Hepatitis C virus 
<400> 46 

ccccgaattc aagctttcgc ttctcgtggt aatcatgttg ctcctacgca ttatgttgtt 60 
gagtcagatg cttctgcttc 80 

<210> 47 
<211> 86 
<212> DNA 

<213> Hepatitis C virus 
<400> 47 

ccccggatcc tagaaagcca gaaccttctc cttaaactga ccagcaatag cacgcgtctc 60 
gtccatatac ggcagagaag cagaag 8 6 

<210> 48 
<211> 80 
<212> DNA 

<213> Hepatitis C virus 
<400> 48 

ccccgaattc aagctttcgc tagtcgtggg aatcatgtgt cgccgcgtca ttatgtgcct 60 
gagtctgagc ctcaggttgt 80 

<210> 49 
<211> 80 
<212> DNA 

<213> Hepatitis C virus 
<400> 49 

ccccggatcc taagaagcct cctccatctc atcaaaagcc tcatacagta tctccttatc 60 
cggcgtaaca acaacctgag 80 

<210> 50 
<211> 52 
<212> DNA 

<213> Hepatitis C virus 
<400> 50 

ccccgaattc aagcttctaa ggccgcgctg attgaggagg gtcagcgtat gg 52 
<210> 51 



t I) ■ 
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<211> 48 
<212> DNA 



<213> Hepatitis C virus 
<400> 51 

ccccggatcc tactggatct tagacttcag catctcagcc atacgctg 48 

<210> 52 
<211> 352 
<212> PRT 

<213> Hepatitis C virus 
<400> 52 

Ala Ala His lie Pro Tyr Leu Glu Gin Gly Met His Leu Ala Glu Gin 

15 10 15 

Phe Lys Gin Lys Ala Leu Arg Pro Ala Val lie Pro Asp Arg Glu Val 

20 25 30 

Leu Tyr Gin Glu Phe Asp Glu Met Glu Glu Ala Ser His Leu Pro Tyr 

35 40 45 

lie Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu 
50 55 60 



Ala Phe Ala Ser 
65 

Glu Ser Asp Ala 

Arg Glu Phe Asp 
100 

Val Ser Pro Ala 
115 

Pro Tyr lie Glu 
130 

Val Leu Ala Phe 
145 

Val Pro Glu Ser 

Leu Tyr Arg Glu 
180 

lie Glu Gin Gly 
195 

Ala Phe Ala Ser 
210 



Arg Gin Asn His 
70 

Arg Pro Ala lie 
85 

Glu Met Glu Glu 

His Tyr Val Pro 
120 

Gin Ala Gin Val 
135 

Ala Ser Arg Gly 
150 

Asp Ala Lys Pro 
165 

Phe Asp Glu Met 

Met Met Leu Ala 
200 

Arg Gly Asn His 
215 



Val Ser Pro Thr 
75 

lie Pro Asp Arg 
90 

Ala Phe Ala Ser 
105 

Glu Ser Asp Ala 

lie Ala His Gin 
140 

Asn His Asp Ser 
155 

Ala lie lie Pro 
170 

Glu Glu Ser Gin 
185 

Glu Gin Phe Lys 

Val Ala Pro Thr 
220 



His Tyr Val Pro 
80 

Glu Val Leu His 
95 

Arg Gly Asn His 
110 

Ser Gin Ala Ala 
125 

Phe Lys Glu Lys 

Pro Thr His Tyr 
160 

Asp Arg Glu Val 
175 

His Leu Pro Tyr 
190 

Gin Lys Ala Leu 
205 

His Tyr Val Thr 



Glu Ser Asp Ala 
225 

Gin Gin Tyr Asp 

Val Ala Pro Thr 
260 

Pro Tyr Met Asp 
275 

Val Leu Ala Phe 
290 



Lys Pro Ala Leu 
230 

Glu Met Glu Glu 
245 

His Tyr Val Val 

Glu Thr Arg Ala 
280 

Ala Ser Arg Gly 
295 



Val Pro Asp Lys 
235 

Ala Phe Ala Ser 
250 

Glu Ser Asp Ala 
265 

lie Ala Gly Gin 

Asn His Val Ser 
300 



Glu Val Leu Tyr 
240 

Arg Gly Asn His 
255 

Ser Ala Ser Leu 
270 

Phe Lys Glu Lys 
285 

Pro Arg His Tyr 
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Val Pro Glu Ser 
305 



Glu Pro Gin Val 
310 



Val Val Thr Pro Asp Lys Glu He 



_315 3 2_0___. 



Leu Tyr Glu Ala 



He Glu Glu Gly 
340 



Phe Asp Glu Met 
325 

Gin Arg Met Ala 



Glu Glu Ala Ser Lys Ala Ala Leu 

330 335 
Glu Met Leu Lys Ser Lys He Gin 



345 350 



<210> 53 
<211> 57 
<212> DNA 

<213> Hepatitis C virus 
<400> 53 

ctggttccgc gtggatcccc aggaattccc gggtcgactc gagcggccgc atcgtga 57 

<210> 54 
<211> 27 
<212> DNA 

<213> Hepatitis C virus 
<400> 54 

tcgcagcgaa ttctcgagga tccatcc 27 

<210> 55 
<211> 55 
<212> DNA 

<213> Hepatitis C virus 
<400> 55 

ctggttccgc gtggatcgca gcgaattctc gaggatccat ccggccgcat cgtga 55 



